Goto

Collaborating Authors

 low-resource language



Multilingual Pre-training with Universal Dependency Learning

Neural Information Processing Systems

The pre-trained language model (PrLM) demonstrates domination in downstream natural language processing tasks, in which multilingual PrLM takes advantage of language universality to alleviate the issue of limited resources for low-resource languages. Despite its successes, the performance of multilingual PrLM is still unsatisfactory, when multilingual PrLMs only focus on plain text and ignore obvious universal linguistic structure clues. Existing PrLMs have shown that monolingual linguistic structure knowledge may bring about better performance. Thus we propose a novel multilingual PrLM that supports both explicit universal dependency parsing and implicit language modeling. Syntax in terms of universal dependency parse serves as not only pre-training objective but also learned representation in our model, which brings unprecedented PrLM interpretability and convenience in downstream task use. Our model outperforms two popular multilingual PrLM, multilingual-BERT and XLM-R, on cross-lingual natural language understanding (NLU) benchmarks and linguistic structure parsing datasets, demonstrating the effectiveness and stronger cross-lingual modeling capabilities of our approach.


Extraction

Neural Information Processing Systems

Figure 5 shows an schema explaining the extraction of the entities. Each step is depicted in a triplet format: subject,predicate,object . Blue (italics) information is the information extracted at each step. For each step outlined with a dotted rectangle (), the information extracted is the subject; otherwise, the information extracted is the object. Figure 6 show an example of multilingual alignment for the languages considered in the high-resource use case: English, Arabic, Spanish and Russian.


MindMerger: Efficiently Boosting LLM Reasoning in non-English Languages

Neural Information Processing Systems

Reasoning capabilities are crucial for Large Language Models~(LLMs), yet a notable gap exists between English and non-English languages. To bridge this disparity, some works fine-tune LLMs to relearn reasoning capabilities in non-English languages, while others replace non-English inputs with an external model's outputs such as English translation text to circumvent the challenge of LLM understanding non-English.







09933f07ae2ccbca7212bb4e43de8db0-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing Systems

After annotating the entire dataset in each language, there was an additional annotator for each language who reviewed the entire set. Annotators were volunteers, and theyare acknowledged at theendofthiswork.